On the Relative Importance of Toponyms in GeoCLEF

نویسندگان

  • Davide Buscaldi
  • Paolo Rosso
چکیده

In this work we attempted to determine the relative importance of the geographical and WordNet-extracted terms with respect to the remainder of the query. In our system, geographical terms are expanded with WordNet holonyms and synonyms and indexed separately. We checked the relative importance of the terms by multiplying their weight by 0.75, 0.5 and 0.25. The comparison to the baseline system, which uses only Lucene, shows that in some cases it is possible to improve the mean average precision by balancing the relative importance of geographical terms with respect to the content words in the query. We also observed that WordNet holonyms may help in improving the recall but WordNet has a small coverage and term expansion is sensible to ambiguous place names.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The UPV at GeoCLEF 2008: The GeoWorSE System

This year our system was complemented with a map-based filter. During the indexing phase, all places are disambiguated and assigned their coordinates on the map. These coordinates are stored in a separate index. The search process is carried out in two phases: in the first one, we search the collection with the same method applied in 2007, which exploits the expansion of index terms by means of...

متن کامل

Semantic Similarities between Locations based on Ontology

Toponym disambiguation or location names resolution is a critical task in unstructured text, articles or documents. Our research explores how to link ambiguous locations mentioned in documents, news and articles with latitude/longitude coordinates. We designed an evaluation system for toponym disambiguation based on annotated GEOCLEF data. We implemented a node-based approach taking population ...

متن کامل

GeoCLEF: the CLEF 2005 Cross-Language Geographic Information Retrieval Track

Introduction GeoCLEF is a new track for CLEF 2005. GeoCLEF was run as a pilot track to evaluate retrieval of multilingual documents with an emphasis on geographic search. Existing evaluation campaigns such as TREC and CLEF do not explicitly evaluate geographical relevance. The aim of GeoCLEF is to provide the necessary framework in which to evaluate GIR systems for search tasks involving both s...

متن کامل

On the reliability importance of system components

In reliability theory, some measures are introduced , called importance measures, to evaluate the relative importance of individual components or groups of components in a system. Importance measures are quantitive criteria that ranke the components according to their importance. In the literature, different importance measures are presented based on different scenarios. These measures can b...

متن کامل

Determining the Relative Importance of Parameters Affecting Concrete Pavement Thickness

Spending costs in construction of road pavements has turned this subject into one of the significant points in transportation infrastructure of countries. Concrete slabs consider as a paving method with ability of reducing the rehabilitation needs. Therefore, to manage costs and optimize the thickness of concrete pavements, recognizing the amount of determinative factors’ influence will be requ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007